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ABSTRACT 



This initial study investigates the feasibility of using 
weather map types to forecast tropical storm/typhoon movement 
in the Western North Pacific Ocean. Fleet Numerical Oceanog- 
raphic Center's hemispheric D-value analysis fields at 850, 

700 and 500 mb are interpolated to a standard grid oriented 
relative to the present surface tropical storm position. Em- 
pirical Orthogonal Function (EOF) analysis is used to repre- 
sent the fields in terms of only 10 coefficients in an effort 
to separate true synoptic map features from random noise. A 
least squares approach is then applied to determine character- 
istic synoptic patterns relative to the tropical storm, and 
to relate each individual case to a "map type" . An analog 
type approach is used to forecast the 30- , 36-, 54-, 60-, 78- 
and 84-hour positions. All storms determined to be of the 
same map type as the candidate storm were considered to be 
analogs, and their storm tracks were rotated to have the same 
past 12-hour movement. Forecast errors ranged from 135 n.mi. 
at 30 hours to 490 n.mi. at 84 hours. Although no indepen- 
dent cases were tested, and the techniques employed have not 
been optimized, the mean vector errors indicate that the basis 
technique warrants further investigation. 
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I. 



INTRODUCTION 



Neumann (1979) states that the temporal and spatial 
analysis of tropical cyclone paths show a tendency to be re- 
petitive, and to be associated with identifiable weather 
patterns. Experienced tropical cyclone forecasters routine- 
ly examine the synoptic fields for the position of the sub- 
tropical ridge, any weakness in the intensity of the ridge, 
locations of migratory troughs in the mid-latitudes and any 
other synoptic features that may give indications of future 
storm movement. In a series of lectures at the Naval Post- 
graduate School, Dr. William M. Gray, Colorado State Univer- 
sity, indicated that much more can be done to examine how 
tropical storms react to their environment. In addition, he 
stated that more immediate gains in forecasting storm move- 
ment would come from examining the surrounding synoptic 
fields than from detailed modeling of the inner core of the 
storm. He bases this assessment on the fact that for at 
least the foreseeable future, detailed data in the immediate 
vicinity of the storm will not be available. Furthermore, 
with continued advances in satellite sensors and satellite 
interpretation, data will be available to better define the 
environment around the storm, if not the inner circulation. 
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This research is an initial study to determine the feasi- 
bility or applying a synoptic map-typing scheme to the prob- 
lem or forecasting tropical storms in the Western North 
Pacific Ocean. The basic concept is that tropical storm 
movement is directly related to the forcing of the general 
synoptic height field in which it is embedded, and that this 
forcing can be represented by a set of map types. Because 
of time limitations, the techniques applied could not be 
optimized. Rather, the aim is to demonstrate that the basic 
concept is sound, before perfecting the technique and adding 
the detailed refinements typical of other analog schemes. 

Analog techniques typically involve relating the current 
storm with historical storms that are temporally and spa- 
tially similar. Among the parameters used in the Atlantic 
Ocean by Hope and Neumann (1970) and in the Pacific Ocean by 
Jarrell and Somervell (1970) are distances to the subtropi- 
cal ridge and to the nearest trough. Instead of deriving 
such subjective parameters, this study directly uses the 
synoptic fields to objectively determine which relationships 
are the important ones. Tse (1966) attempted a similar 
study, in which he subjectively developed five groups of map 
types. He successfully used these to stratify his sample 
cases prior to developing a pressure gradient nomogram to 
predict typhoon movement. 
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In ~his study, analyzed D-value (deviations from stan- 
dard heights at specific pressure levels) fields were ob- 
tained from Systems and Applied Sciences Corporation, 
Monterey, CA. A total of 5C4 individual cases, from 1 Janu- 
ary 1S67 to 31 December 1375, were interpolated to a stan- 



dard grid (see Fig. 1). Since map types must be constructed 
from synoptic situations relative to the tropical storm, 
this grid had to be movable in space so that the storm is 
always located in the same position. In each case the S53, 
700 and 500 mb fields were utilized to determine which 
specific level provided the best steering information. 
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The standard grid used to plot the D-value 
fields. The standard position of the storm 
on every map is at position 70. 
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empirical Orthogonal Function (EOF) analysis was used 
for iwo reasons. First, it can be used to separate the ran- 
dom noise from the true synoptic patterns. Secondly, it can 
be used to reduce the number of descriptors required to de- 
scribe the synoptic field from the 120 grid points to a 
smaller number of EOF coefficients. Map types were selected 
using least squares sratistics from the EOF representations 
of the E-value fields surrounding the 504 storm cases. The 
basic procedure is analogous to that of Lund (1963). Each 
of the historical fields is then screened a second time to 
assign it to one of the selected map types. 

As a preliminary test of the usefulness of this map 
typing, a forecast position is derived using archived best- 
track storm positions. Each forecast storm position is 
derived from an unweighted average of best-track positions 
(rorated so that each case has the same movement for the 
previous 12 hours) for all the other storms within the map 
type. This procedure is applied to all three levels and 
six forecast time intervals. The results are then used to 
support the hypothesis that storm movement is related to 
the surrounding synoptic height field. Finally, several 
suggestions are made for further improvements to this tech- 
nique, and for using the basic concept to develop another 
type of analog scheme. 
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II. DATA ACQUISITION AND EMPIRICAL 
ORTHOGONAL FUNCTION ANALYSIS 



A. STORM SELECTION 

Data from a ten-year period from 1 January 1967 to 31 
December 1976 form the sample. All storms between 1 October 
1972 and 31 December 1972, and 7 September 1973 to 31 Decem- 
ber 1973 had to be omitted because the D-value fields were 
not available. Only two historical data sets, the best- 
track storm data and the upper-level D-value analysis 
fields, were required. 3oth sets were available from the 
Fleet Numerical Oceanographic Center's (FNOC) archive files. 

The actual selection of the 504 case studies was made 
on the basis of the following criteria: 

(1) The storm had to be at least tropical storm 
strength, with a maximum wind speed in excess of 35 knots; 

(2) The storm had to have at least 12 hours of prior 
track and persist for at least 30 hours; 

(3) Only 00 GMT and 12 GMT initial storm positions were 
considered since upper-level analysis fields are only avail- 
able at these times; 

(4) Each case had to have all three levels C 8 5 0 , 700 
and 500 mb) of data; 
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(o) To insure independent samples of gridded D-value 
maps relative to the same storm, each case had to be at 
least 35 hours apart. However, this restriction did not ex- 
clude data rrom a second storm occurring on the same date, 
because each case would be gridded according to its own 
location, and would therefore represent independent samples; 
and 

(6) All storms located south of 10° N lat were elimi- 
nated since that would extend the grid into the Southern 
Hemisphere . 

A standard grid was required that could be relocated 
based on the position of the tropical storm. Neumann (1979) 
presents a grid that has been frequently used in Atlantic 
trcpical cyclone forecast models. We selected a similar 
8 x 15 grid and positioned the storm center at the 70 posi- 
tion (see Fig. 1), since most storms have a west-northwest 
movement . The storm center and grid were located in the 
FNOC 63 x 63 hemispheric analysis fields and the D-values 
were interpolated to the grid points using the BSSLG sub- 
routine from the FNOC computer library. This subroutine 
interpolates a data field of dimensions (M x N) for a point 
using Bessel's central difference formula. 

Best-track storm positions are collected every 5 hours, 
beginning 12 hours prior to the forecast time and extending 
to + 84 hours. Since upper-level data fields are only 
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available at 00 GMT and 12 GMT, map typing can be done only 
twice each day. Because the analysis fields are not avail- 
able until about 03 GMT and 15 GMT, a 24-hour forecast from 
map time would only be an 18-hour forecast by the time the 
Joint Typhoon Warning Center (JTWC) could use it to orepare 
a warning. On the basis of these considerations, forecasts 
are made at 30, 36, 54, 60, 78 and 84 hours from analysis 
time, with new forecasts available for the 06 GMT and 18 GMT 
warnings (see Fig. 2). In Table I, the 504 cases are broken 
down by the number of cases in the sample with sufficient 
best-track data to be useful in developing the analog 
forecast positions. 

An alternate approach of using the most recent upper- 
level analysis with the current storm position (+6 or +12 
hours from analysis time) was considered unsatisfactory for 
two reasons. First, there would always be a discrepancy be- 
tween the present storm position and the actual position of 
the low pressure center in the map. Elimination of rhis 
discrepancy would require smoothing in the region of the 
prior storm position and the introduction of a bogus storm 
at the present position. Secondly and most importantly, the 
synoptic conditions to the north of the storm that may affect 
storm motion are generally moving in the opposite direction 
of the tropical storm (.see Fig. 3). Therefore, the synoptic 
pattern north of the storm would generally be translating 
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scheduled warnings issued by JTWC each day. 

The X marks upper level analysis time; o 
represents time that forecasts would be avail- 
able from this research, * represents the time 
warnings are issued; and $ represents warning 
forecast times. This diagram illustrates the 
reason for the 30, 36-, 54-, 60- , 78- and 84- 
hour forecast intervals. 



TABLE I 



Number of cases available per forecast 



time interval 



t Interval 


Number of 


30 


504 


36 


482 


54 


387 


60 


360 


78 


273 


80 


250 



Cases 
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^iuch . aster than the tropical storm and the synoptic feat! 
south of the storm. 




Figure 3. 



This is a very generalized picture of the 
flow as it appears on the grid. Note the 
difference in speed, relative to the storm, 
of features to the north of the storm 
compared to features south of the storm. 



3. DATA NORMALIZATION 

Having interpolated the data to the 120 standardized 
grid points for each of the 504 maps, the data must be 
arranged into a matrix for the EOF analysis. Each column 
represents a map type and each row represents a grid poin* 
The result is a 120 x 504 matrix. The data must be 



ires 
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normalized at each point to use the EOF method of extracting 
"signal ' rrom "noise". This "noise" comes primarily from 
errors in the observations used to construct the D-value 
fields and from subgrid scale phenomena. The normalization, 
performed by rows , is 






x. . - x 

ip : 

s . 

D 



( 2 . 1 ) 



where i represents the map number; j represents the grid 
point; represents the standardized D-value, x 4 represents 

the mean (over i) of the grid-point values j ; s^ represents 
the standard deviation at grid-point j ; and x^^ represents 
the D-values of the ith map at grid-point j. 

Two other normalization procedures were tried: 1) nor- 

malizing each map (i.e. column), using x,- and s ^ and 2) nor- 
malizing over the whole matrix using x and s, as constants. 
Each method emphasized certain aspects of the data field at 
the expense of another. Fig. 4 and Fig. 5 are examples of 
the results of the different normalizing schemes after the 
EOF computations and truncation. Normalization by grid point 
(as in (2.1)) insures that the variance at each grid point 
is given equal weight and emphasizes the features in the 
southern portion of the maps. Normalization by map insures 
that the variance of each map is given equal weight. Since 



most of the variance, within each map, is usually in the 
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Figure ‘l . The effect of the different normalization schemes at 500 mb. The 
four representations are for 00 GMT, 18 Mar 1967. A is the actua 
field; B, C and D are the EOF representations (truncated at 10 
terms) using grid-point normalization, whole field normalization 
and map normalization, respectively. The isopleths are D-values 



aa 
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igure 5. Similar to Fig. 4 except for 12 GMT 25 Aug 19G7. 



northern region, this method generally emphasizes the north- 
ern regions, but not always. Normalization by the entire 
field insures that the variance of each point of each map is 
given equal weight. The effect of this method varies from 
map to map and depends on how the variance of the map is dis- 
tributed relative to the average variance over all cases. 

In rig. 4 A and Fig. 5A, the actual map data, first note 
the detail and then the "noise" in the grids, primarily 
represented by the unsmoothed isopleths. Then note how much 
smoother the truncated eigenvector fields are regardless of 
the normalization scheme. In Fig. 4B and Fig. 5B , the grid- 
point normalized maps, the definition and strength of the 
troughs in the westerlies in the northern portions of the 
grid are poor. However, the tropical sections of the grid 
are represented well. Particularly note the low pressure in 
the southwest corner of Fig. 5B that was poorly analyzed in 
both Fig. 5C and Fig. 5D. The maps normalized by the entire 
map's mean and standard deviation, Fig. 4C and Fig. 5C, have 
good representations of troughs in the westerlies. However, 
they miss the intensity of the low pressure area in the 
southwest section of Fig. 50. Almost the same comments can 
be made for Fig. 4D and Fig. 5D as for Fig. 4C and Fig. 5C. 
The method of normalization by grid points used here Ceqn. 
(2.1)) was selected to keep the large variance in the north- 
ern part of the grid from overpowering the variance in the 
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southern part. Also by following xhis method, follow-on 
studies will be able to use the Preisendorfer and 3arnett 
(1977) method for truncating the EOF series. In Fig. 6, 
the maps of the 350, 700 and 500 mb grid-point means and 
standard deviations, that are used to normalize the grid 
via (2.1), are presented. Particularly note the north- 
south variations of the means and the large standard devia- 
tions in the northern portions of the maps at all levels. 
There is also a relative maximum in the standard deviations 
at the location of the tropical cyclone, which indicates a 
sizable variation in storm intensity from map to map. 

C. EMPIRICAL ORTHOGONAL FUNCTION (EOF) ANALYSIS 

The concept of using EOF analysis in meteorology was 
first introduced by Lorenz (1956). His description formed 
the basis for a wide range of uses of the technique in the 
meteorological field. Kutzbach (1967) provides a concise 
mathematical and descriptive outline of the technique upon 
which this research is based. The object of using the EOF 
technique is two-fold. First, it is used in an attempt to 
separate the true "signal" from random "noise" in the D- 
value fields. Secondly, it provides a method to reduce the 
number of variables from a large number of highly correlated 
variables to a smaller number of independent variables. 

Only a general description of the concept and procedure is 
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Figure 6. The D-value means Cleft side) and standard 

deviations Cright side) for 850 mb (bottom) , 
700 mb Cmiddle) and 500 mb (top). The 
isopleths are D-values Cm) . 
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included here. 



A more detailed mathematical description of 
the process is presented in Appendix A. A listing of the 
computer subroutines used in the analysis is given in 
Appendix 3. 

A 120 x 504 matrix X was formed, such that each grid 
point in each map is normalized in relation to that grid 
point over the 504 maps. Each map may be thought of as a 
vector in 120-dimensional space. The 504 vectors represent- 
ing each map will tend to be highly correlated. Therefore, 
it is expected that the points representing the position of 
the head of the 504 vectors will tend to cluster in space. 
The objective of the EOF analysis is to use the 504 points 
to determine a set of 120 orthogonal vectors to represent 
the basis of the 120-dimensional orthogonal space. To de- 
termine this 120-dimensional space, first form the symme- 
tric normalized covariance matrix. The columns are vectors 
representing the variance in the data matrix. The first 
axis in this new space, hereafter referred to as the first 
eigenvector, will be determined such that it will be a li- 
near best fit to the variance vectors. This procedure is 
equivalent to maximizing the explained variance (eigenvalue) 
and minimizing the unexplained variance (residual) of the 
data by the first eigenvector. The second eigenvector will 
again be a linear best fit to the variance vectors, with the 
additional requirement that it be orthogonal to the first. 
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Thus , this eigenvector will explain as much of the remaining 
variance as possible. The third eigenvector will again be 
the linear best fit to the variance vector, but it must now 
be orthogonal to the first two eigenvectors. The procedure 
will be continued until all the variance is explained, which 
will be equivalent to having an eigenvector for each of the 
120 dimensions. 

One may think of EOF analysis as being analogous to a 
Fourier series. Instead of representing the data as a linear 
combination of space-orthogonal sines and cosines, the space- 
orthogonal functions in the EOF analysis are derived from the 
actual variance in the data. As with Fourier analysis, each 
eigenvector has an associated (time) coefficient. This co- 
efficient, when combined with its eigenfunction and summed 
over the entire eigenspace , provides an exact reproduction 
of the original field. That is, 



M 

XCx,tX = Y'' EOF (x>- c (t). (2.2) 

7 J m m 

m= 1 

where X represents the value at grid-point x of map t in the 

data matrix; EOF represents the empirical orthogonal func- 

m 

tions which are derived by relating the unit eigenvectors to 
the original 120-dimensional space; and C are the time 



26 



er- 



coefficients (in this case time is determined by the diff 
ent map times) related to the m th eigenvalue, and M = 120. 
These coet f icaent s are time orthogonal (Kutzbach, 1967) and 
are obtained from the eigenvectors and the original data 
field. The specific mathematical relationships are described 
m Appendix A. When a linear combination of eigenvectors 
and their respective time coefficients are translated into 
the original space , the eigenvectors represent a series of 
functions in the same manner as sines and cosines represent 
functions in Fourier analysis. A truncated linear combina- 
tion (i.e. (2.2) with M less than 120) can be shown to 

approximate the original maps with the "noise" removed 
( Preisendorf er and Barnett, 1977). 

Upon completion of the EOF analysis, the eigenvectors 
and eigenvalues represent solutions to the maximization of 
an "explained variance" problem. Thus, depending on the 
geophysical parameter being analyzed and the quality of the 
data available, the first few eigenvectors often represent a 
large fraction of the variance in the data set. Preisen- 
dorfer and Barnett (1977) have developed an empirical 
formula for separation of true "signal" from "noise" in EOF 
analysis. Since each eigenvector represents a weighted por- 
tion of the variance, this separation can be accomplished 
by truncating the number of eigenvectors included in the EOF. 
Because of time constraints for this study, an arbitrary 
truncation was made at eigenvector 10. Without formal 
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application of the Preisendorf er-Barnett techniques, we can 
not be assured of the statistical significance of the higher 
order eigenvalues. This is one aspect that needs to be 
optimized in any subsequent studies. 

Because most of the variance is contained in the first 
tew eigenvectors, excellent "noise-free" representations can 
be made from just a few of them. The cumulative percent of 
the variance expressed by the linear combination of eigen- 
vectors derived from the 504 case studies is shown in Table 
II. For example, 83-85 percent of the variability is ex- 
pressed in the first 10 eigenvectors at all three levels. 

In fact, beyond a certain point, the eigenvectors represent 
such a small portion of the variance that they just repre- 
sent noise in rhe map fields. In this manner we can reduce 
the number of variables required to represent the maps from 
120 grid points to 10 coefficients of the orthogonal eigen- 
vectors. In the next section, we use These 10 coefficients 

(C (t)) to derive a series of characteristic maDs which we 
m 

will call map types. 

The maps in Fig. 7 represent case number 1 and are an ex 
ample of how the linear combinations of the EOF build the 
map. The last map in Fig. 7 is the same as shown in Fig. 4A 
Without describing in detail each map in Fig. 7, it can be 
seen that the more drastic changes in the pattern occur in 
the belt of westerlies after the inclusion of more than 10 



28 



TABLE II 



Cumulative percent of variance and the actual variance 
(m ) or eigenvalue as expressed by the linear combina- 
tion of eigenvectors for each level. An arbitrary 
cutoff at 10 eigenvectors was used here. 



Eigen 


850 


mb 


700 


mb 


500 


mb 


vector 


Percent 


Variance 


Percent 


Variance 


Percent 


Varianc 


1 


34 


40.15 


25 


29 .53 


33 


39.38 


2 


51 


20.35 


46 


25. 95 


51 


21.61 


3 


60 


11.09 


56 


11.17 


59 


9 .32 


u 


67 


8.42 


62 


7 . 91 


65 


7 .45 


5 


72 


6 . 09 


68 


7.11 


70 


6.05 


6 


75 


4.14 


72 


5 . 14 


75 


5 .29 


7 


78 


3 . 42 


76 


3 .77 


78 


3.98 


3 


31 


2 . 94 


79 


3 .59 


81 


3.11 


9 


83 


2 . 56 


81 


2.57 


83 


2 .49 


10 


85 


2.16 


83 


2 . 29 


85 


2 . 11 


11 


36 


1.74 


84 


1.98 


86 


1.86 


12 


87 


1. 52 


86 


1.61 


88 


1.71 


13 


88 


1.28 


88 


1.41 


89 


1 .41 


14 


89 


1.05 


89 


1.25 


90 


1.19 


15 


90 


.95 


90 


1.13 


91 


1.05 


15 


91 


. 89 


91 


1.06 


91 


. 95 


17 


92 


. 84 


91 


.97 


92 


. 78 


18 


92 


.65 


92 


. 74 


93 


. 77 


19 


93 


. 57 


92 


.62 


93 


.69 


20 


93 


.55 


92 


.61 


94 


.58 


120 


100 




100 




100 





to 15 terms. With just one term, the basic flow is estab- 
lished - moderate westerlies, weak gradient in the cropics 
and a trough where the storm is located. After including abou 
15 terms only slight refinements of the pattern take place. 
The "noise" in the map starts to become more obvious with the 
inclusion of 40 terms. The isopleths begin to show irregu- 
larities and are not as smooth as in the earlier maps. Be- 
yond 80 terms it is almost impossible to see any differences. 
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Figure 7. An example of the effect of truncation of the 
EOF for 00 GMT, 18 Mar 1967. The number above 
the map is the number of terms included (the 
truncation point). The isopleths are D-values (m). 
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Figure 7. continued 
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III. MAP TYPING 



At this point , three choices are available. The EOF 
analysis could be treated just as an approximate method to 
eliminate the noise from the data. Then "noise-free" maps 
could be produced and map typed exactly as in Lund (1363). 
The second choice , and the one followed here , was to use the 
truncated EOF analysis to compare the maps in eigenvector 
space. The actual map typing procedure is similar to the 
Lund procedure, except that instead of having 120 variables, 
there are only 10 in the truncated eigenvector space. 

A third possibility is to consider the eigenvectors as 
the map types. The grid-point normalized fields could be 
compared in either a least squares approach or a direct cor- 
relation approach, to the actual eigenvectors. This method 
was net chosen in this study since the statistical signifi- 
cance of each eigenvector was not known ( Preisendorf er and 
Barnett, 1377). By using the coefficients, and therefore 
essentially the approximated data sets, we hoped to minimize 
the problem of not knowing the statistically derived 
truncation point. 

As described previously, the eigenvectors represent the 
axes in space. It is the coefficients that actually locate 
the position of each individual map in that space. Although 
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each map is an independent sample, the maps are highly cor- 
related and will when plotted in space tend to form clusters, 
ihe idea is to find the map that is most representative of 
each cluster and call that the map type. To do this, each 
map is compared in the least squares sense to all the other 
maps. This is done by using the coefficients of the trun- 
cated eigenvector space in the following manner 

10 

LS jk = Z <c i3 - C ik >2/S i 2 
i=l 



where LS^ is the least squares value comparing map j with 
map k ; Ch ^ is the coefficient of the it'n eigenvector of map 

j •, is the coefficient of the ith eigenvector of map k: 

2 • 

and s is the variance accounted for by the ith eigenvector 
in the truncated eigenvector set. The LS^ value for each 
map wich all ether maps is computed and stored. 

Now, search for the map that is most representative of 
the "tighrest" cluster and designate this as map type 1. To 
do chis , the map that had the largest number of other maps 
falling within the least squares radius of five units was 
designated map type 1. The least squares radius of five 
units was selected arbitrarily on the basis of a trial and 
error method. Since map type 1 was the most common map, it 
should have the largest number of maps associated with it, 
but no so many that the sample is so depleted that the 
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other map types can not be determined. The least sauares 
radius of five units appeared to give the most representative 
sets. It is possible that a more rigorous approach for find- 
ing a statistically valid value of the cluster radius could 
be determined. 

Type 1 and all maps selected within the least squares 
radius of five were removed from the sample prior to the 
selection of the next map type. Since it is anticipated that 
the next cluster would not be as tight as the first, the 
least squares radius was increased by two units. Map type 2 
is then selected as the map with the largest number of remain- 
ing maps within a least squares radius of seven units. Then 
map type 2 and all maps within seven units of it were removed 
from one sample. The process, with an increasingly larger 
radius, is continued until at leasx 470 maps were selected. 

Finally, all the remaining maps are lumped together in 
the last type. The actual map type is selected in the same 
manner as all the others , except that now all the remaining 
maps are placed with this type regardless of the value of the 
least squares radius. 

When this process was completed, there were 11 or 12 map 
types for each level, each with a number of other maps simi- 
lar to it. As the typing process evolved, there was no 
attempt to keep the least square radius circles from over- 
lapping. Thus, the map typing is not complete, because a 
map in type 1 may actually be closer to map type 3 than it 
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is to map type 1. Therefore, each map must again be compared, 
in the least squares sense , to each of the selected , most 
representative map types. Then each map is grouped with the 
type it is nearest to in space. Now the map typing is com- 
plete. Only the results of the 500 mb map typing are dis- 
cussed here. Similar comments could be made for the 850 and 
700 mb map types, but in the interest of brevity just the map 
types and forecast results are presented in Appendix C. 

Each of the eleven 500 mb map types in Fig. 3 is clearly 
different. The following is a discussion of the distinguish- 
ing synoptic features for each map type: 

Map Type 1: The storm is situated moderately deep in the 

tropics. To the west of the storm the pressure gradient is 
very weak. The broad ridge to the east of the storm is also 
weak. Well to the north of the storm the pressure gradient 
of the westerlies is weak with troughing indicated to the 
north-northwest of the storm at a distance of approximately 
700 n.mi. 

Map Type 2: Here the storm is deep in the tropical belt 

with a strong east-west subtropical ridge centered approxi- 
mately 600 n.mi. almost due north of the storm. A weak ex- 
tension of this ridge separates the storm from the broad low 
1300 n.mi. to the west of the storm. To the east of the 
storm the influence of the strong subtropical ridge extends 
to the southern boundary of the map. 
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Figure 8. The 500 mb map types. The number above 
each map refers to the map type number, 
i.e. in the order selected. Isopleths 
are D-values Cm) . 
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Figure 8. continued 
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\ap xype 3 : In Phis case Phe sPorm is near Phe norPh- 

ern boundary of Phe Propical region, wiPh weak pressure 
gradienPs Po Phe wesP and wesP-souPhwesP . The cenPer of a 
sProng cell aP Phe easPern exPremiPy of Phe subPropical 
ridge is evidenP approx imaPely 1200 n.mi. Po Phe easP-norPh- 
easP of Phe sPorm. The sProng wesPerlies PhaP begin only 
600-700 n.mi. Po Phe norPh of Phe sPorm furPher inPensiPy 
Po Phe norPh of Phis ridge cell. There is a weak Prough Po 
Phe wesP-norPhwesP . 

Map Type 4: Here Phe sPorm is only 300-400 n.mi. from 

Phe sProng wesPerlies. IP is locaPed bePween Pwo cells of 
Phe subPropical ridge wiPh Phe sPronger cell due easP aP Phe 
edge of Phe grid. There is a Prough in Phe wesPerlies almosP 
due norPh wiPh Phe sPorm apparenPly aP Phe souPhern exPenP 
of Phe Prough. 

Map Type 5 : Again Phe sPorm appears po be locaPed deep 

in Phe Propics because of Phe surrounding weak pressure 
gradienp and Phe absence of Phe sProng wesPerly flow in Phe 
norPhern porPions of Phe grid. There is a broad weak ridge 
Po Phe easP of Phe sPorm. An exPension of Phis ridge can be 
seen in Phe wesPerlies far Po Phe norPh of Phe sPorm. 

Map Type 6 : In Phis case Phe presence of Phe sPorm is 

barely discernable. The implicaPion is PhaP the sPorm is 
eipher very weak or PhaP iPs influence is noP seen aP 500 mb. 
AnoPher poinP PhaP needs Po be made here is PhaP Phe daPa 
on Phe sPandard grids is inPerpolaPed direcPly from Phe 
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analysis fields as they appear on the FNOC data tape. Some 
fields may have the storm location and intensity inserted 
manually while others may not. In the scope of this re- 
search, there is no way of knowing whether or not the bogus 
data were applied to any particular map used in our data 
set. The center of the subtropical ridge is positioned only 
about 500 n.mi. to the east of the storm with its influence 
extending well to the west of the storm. The westerlies to 
the north are of only moderate intensity with a trough-ridge 
pattern in the northwest quarter of the grid. 

Hap Type 7 : Again the storm circulation is only weakly 

apparent at 500 mb by the slight deformation in the south- 
west section of a strong subtropical ridge to the northeast 
of the storm. In this case however, there is a strong mid- 
latitude trough system extending to the latitude of the 
storm to the storm's west. The westerlies through the 
trough are moderate, increasing in speed north of the sub- 
tropical ridge position. 

Map Type 8: In this case the storm's influence is felt 

strongly at 500 mb. The D-value at the storm's location is 
less than 240 m, which is more intense than all other map 
types except map type 11. The storm is located between two 
strong cells in the subtropical ridge. The pressure gradient 
to the east of the storm is much steeper than to the west 
within 600 n.mi. of the storm. The westerlies appear to be 
of moderate intensity with a well defined trough to the 
north-northwest . 
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Map Type 9 : All indications here are that the storm, as 

represented by the trough in the westerlies, has almost en- 
tirely departed the tropics. In the western portion of the 
grid the subtropical ridge has either completely broken down 
or the storm is so far north that the western part of the 
ridge is south of the grid. The ridge center to the east of 
the storm is strong, and there is a strong westerly current 
to the north of it. There is a trough in the westerlies 
almost due north of the storm. 

Map Type 10 : As with map types 6 and 7 , the storm here 

is barely discernable at 500 mb. It appears here as a break 
in the ridge. The ridge axis is oriented from east-northeast 
of the storm to almost due west of the storm and is much 
stronger in the northeastern extension. The westerlies are 
strong with a trough located north-northwest of the storm. 

Map Type 11: In this case the storm appears deeper than 

in all the others. The storm appears to be 700-800 n.mi. 
south of the subtropical ridge axis. A strong cell in the 
ridge is evident to the northwest of the storm. A mid-lati- 
tude trough is evident in the northeast corner of the grid, 
and a trough extends through a weakness in the ridge to the 
storm. The westerly flow is well removed from the storm 
(1100-1200 n.mi. 1. 

With this brief discussion of each map, it is clear that 
each represents a different synoptic situation. Fig. 9 and 
Fig. 10 show examples of map types 1 and 2 storm cases from 
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Figure 9. Examples of map type 2 (.500 mb). The letters 

above the maps are for reference for the text. 
The isopleths are D-values (m) . 
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Figure 10. Examples of map type 1 C500 mb). The letters 

above the maps are for reference for the text. 
The isopleths are D-values Cm) . 
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the sample (more examples of the other map types are shown 
in Appendix D) . Since each map had to be classified with 
one of the characteristic maps, the least squared distances 
between the individual maps and its characteristic map may 
vary considerably even within one map type. Thus, it is to 
be expected that some maps show greater similarities to 
their characteristic map than others (see Fig. 8). 

In Fig. 9, the map type 2 maps have a strong ridge to the 
north of the storm. The actual position of the primary cell 
varies from north-northwest of the storm to northeast of the 
storm. All the maps show the storm deep in the tropics. 

The intensity of the storm varies considerably. In Fig. 9, 

A and E, the storm is very weak at 500 mb, while it is very 
strong in Fig. 9, B and D. The westerlies, well to the north, 
vary in intensity but are generally fairly weak and the 
pattern is similar to the map type. All the cases indicate 
a broad area of low pressure in the tropics due west of the 
storm, except in Fig. 9C when the low pressure is to the 
west-southwest. Therefore the basic patterns are very 
similar. 

In Fig. 10, the map type 1 examples do not show as much 
similarity to their map type case as do the map type 2 ex- 
amples, but the basic patterns are still evident. All maps 
show strong evidence of the trough in the westerlies to the 
nort'n-northwest of the storm that is found in the character- 
istic map. All cases show westerlies that are weak to 



43 



moderate in intensity. The storm is always moderately deep 
in the tropics with the exception of Fig. 10E. In Fig. 10, 

B, C, D, and E, the storm is weakly represented at 500 mb as 
an the map type storm. All show the ridge basically to the 
east of the storm. Again, it is not too difficult to see 
the similarities. 

After examining Fig. 7 and noting that map type number 1 
may have needed more terms to adequately define the details 
of the synoptic features, one actual size (500 mb) with the 
EOF series truncated at 20 terms (approximately 93 percent 
of the variance) was tried. The map types selected using the 
20 EOF terms are shown in Fig. 11. The selection procedure 
was similar to that described above except that a larger 
(15 versus 5) initial least squares distance was required 
initially in determining the characteristic map types. The 
inclusion of more terms appears to better separate the maps 
in eigenvector space. The maps have considerably more detail 
and the D-values in the troughs and ridges much more closely 
resemble the actual maps. It is clear that the map typing 
is very sensitive to the number of EOF modes that are re- 
rained in the analysis. It is possible that 10 EOF terms 
may nor be sufficient to properly represent the significant 
features for several possible reasons. First the data should 
have already been somewhat filtered, since the FNOC fields 
have been objectively analyzed. 3ecause the grid size 
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Figure 11. The 500 mb map types using 20 terms of the EOF. 

The number above the maps refer to the map type 
number. The isopleths are D-values (m) . 
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Figure 11. continued 
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Figure 11. continued 
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used is not that much different from the tropical grid 
size of the analysis fields, there would appear to be little 
subgrid scale features to be removed from the grid-point 
values . 
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IV. ANALOG TRACK FORECASTS BY MAP TYPES 



With each map now typed, a forecast position is derived. 
The following procedure is simple and later can be greatly 
refined, as indicated in the next chapter. As indicated in 
the introduction, the object here was not to optimize the 
technique, but rather to see if a representative forecast 
could be made based on the map types. 

Before any comparison of forecast positions could be 
made, all the positions had to be standardized. The 0-hour 
position, or map time position, of the storm is converted to 
10° N and 140° E. This position is arbitrary and does not 
affect the results. It only serves as a reference position. 
All other best-track positions are then converted to a lati- 
tude/longitude position relative to this reference position. 
Then, each case is selected in succession. All other cases 
in its same map type serve as forecast analogs. Before the 
forecast positions are actually derived, all the best-track 
positions are rotated about the 0-hour storm position so 
that each analog has the same direction of movement for the 
12 hours prior to map time. There are no restrictions placed 
on this rotation. The equations that accomplish this 
rotation are, 

CALAT = SLAT + ( PALAT-PSLAT ) • FCST/12 
CALON = SLON + (PALON-PSLON) • FCST/12 



where CALAT and CALON represent the rotated analog latitude 
and longitude; SLAT and SLON represent the unrotated analog 
latitude and longitude; PALAT and PALON represent the prior 
12-hour analog latitude and longitude; PSLAT and PSLON 
represent the prior 12-hour current case latitude and longi- 
tude; and FCST represents the forecast interval, i.e. 30, 

36, 54, 60, 78 and 84 hours. It also assumes that changes 
in latitude and longitude by an equal increment represent a 
true distance. 

All rotated analog storm positions within the same map 
type (excluding the map for the case being forecast) are 
then averaged, and this average storm position becomes the 
forecast position for that case. These forecast positions 
are compared against the actual standardized best-track 
position. A mean error and standard deviation are computed 
for each map type and for the full 504 case sample. A sum- 
mary of the results for the full sample is provided in 
Table III. See Appendix C for the map type summaries for 
700 and 850 mb levels. 

It was hoped that this study would show that the easy to 
forecast situations could be separated from the difficult 
ones, and a quantitative estimate of forecast accuracy could 
be made. In a limited sense this can be shown in Table IV 
by comparing map type 2 and 3. The mean error and standard 
deviation for map type 2 are less than those for map type 3 
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TABLE III 





Summary 


of mean 


forecast 


errors 


(n.mi. ) 


and 




standard deviations (n.mi.) for 
type forecast. 


the analog- 


Fcst 


850 


mb 


700 


mb 


500 


mb 


Int 


Mean 


Std Dev 


Mean 


Std Dev 


Mean 


Std Dev 


30 


137.9 


93 . 3 


137 . 9 


93 . 2 


138.2 


95 . 0 


36 


175 . 6 


119.7 


175.2 


119.6 


175 . 8 


121.8 


54 


289 . 3 


180 . 1 


292 . 3 


188.1 


284.4 


183 . 5 


60 


332. 3 


207 . 3 


333.1 


218 . 1 


326 . 2 


211.4 


78 


456.4 


287. 2 


460 . 4 


282.1 


452 . 0 


288 . 9 


84 


492.0 


317. 5 


484 . 2 


321.1 


497 . 8 


338 . 1 


every 


forecast 


time interval. 


This would indicate that 



storms in map type 2 are easier to forecast (by this analog 
procedure) than those of map type 3. Based on the maps in 
Fig. 8, most forecasters would probably agree with this 
assessment. However, a test with independent data after 
optimizing the technique will give a much better indication 
of the significance of this concept. It does show that the 
potential exists. 

It is obvious from the average errors, as presented in 
Tables III and IV, that the results do not represent any 
major breakthrough. However, it must be remembered that the 
techniques have not been optimized and many refinements (see 
Chapter V) can be applied that should give better forecasts. 
In any case, the errors are not that bad when compared to 
the official forecast errors and represent a solid base from 
which to build. The official forecast errors for the speci- 
fic cases in this study are not known. A rough estimate 
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of the official JTWC warning error was computed by taking 
the mean error for each of the years included in this study, 
summing them and dividing by the number of years (10). The 
results indicated an average error of 112 n.mi. at 24 hours, 
229 n.mi. at hours, and 347 n.mi. at 72 hours. These 
average errors include all warnings issued for storms of 
tropical storm intensity for the time frame involved. 

Another question regards rhe level that gives the best 
steering motion. It is obvious from Table III that the 
data from all three levels produced remarkably similar 
errors. Why this occurred is uncertain, but it is con- 
jectured that the adjustment made for the past 12-hour 
motion strongly influenced these average error statistics. 

To further examine how the schemes at each level performed, 
a block of 20 storms was selected at random from the data 
sets and presented in Table V. The data at each level for 
these storms is remarkably consistent, which may again be 
the result of the corrections made for the past 12-hour 
movement . 
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Examples of 30-hour forecasts made at 850, 700 and 500 mb. All positions 
are relative to the standardized 0-hour position (10° N, 140° E) and 
given as lat/long (map type number). 
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V. SUGGESTED IMPROVEMENTS 



Since this was an initial study to verify that the tech- 
niques would work, several procedures were not optimized. 

The first procedure that needs this optimization is the EOF 
truncation procedure , using the P-reisendorfer and Barnett 
(1977) method. In Chapter III we noted that the 20 term 
EOF map types show considerably more detail than the ones 
attempted with 10 terms. Whether this additional detail is 
desirable or not (i.e. , whether it is "signal" or "noise") 
remains to be proved. We suspect that the motion of the 
tropical storm is primarily dependent on the large-scale 
synoptic situation, and the 10 term EOF map types may have 
represented these large-scale features well enough to dis- 
tinguish the map types. Unfortunately, it is also possible 
that some of the first 10 terms of the EOF may actually be 
"noise". Thus, there may be too many terms in the EOF. 

Table VI contains the 30-hour forecast results using the 
20 terms. The average error for the entire sample does not 
show any improvement , but it does show a wider range of 
average errors for the specific map types. Therefore, by 
using more terms it may be possible to better separate the 
easy forecasts from the difficult ones. Remember that this 
suggestion was made on the basis of the forecasts for one 
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level and one forecast interval. More data must be gathered 
before this idea can be really supported or refuted. 



TABLE VI 

30-hour forecast mean errors (n.mi.) 
and standard deviations (n.mi.) at 



500 


mb using 20 


terms of the EOF. 




Map 




Standard 


Number 


Type 


Mean 


Deviation 


of Case 


1 


141.2 


100.3 


173 


2 


125 . 5 


75.4 


49 


3 


130.1 


106 . 2 


65 


4 


110.6 


68.9 


30 


5 


119.0 


65.9 


37 


6 


170.0 


103.7 


24 


7 


133.4 


79 . 9 


19 


8 


130.2 


83.4 


22 


9 


147 . 2 


108 . 8 


17 


10 


184.3 


109.4 


15 


11 


147.7 


90 . 0 


11 


12 


2 36.7 


39.8 


4 


13 


75 . 3 


42 . 2 


11 


14 


216 . 2 


123 . 3 


6 


15 


157 . 1 


71.7 


7 


16 


145 . 6 


48.4 


6 


17 


222.4 


117.6 


6 


13 






2 


Total 


139.6 


97 . 1 


504 



Also, as indicated at the beginning of Chapter III, 
once the proper truncation point has been determined an 
attempt should be made to use the eigenvectors as the map 
types. This use of the eigenvectors should work as well or 
better than the characteristic map type approach and has 
the advantage of much fewer computations. However, this 
procedure would work only for the map-typing scheme and 
would not work for the strictly analog scheme suggested 
later in this chapter. 
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Next, if the map-typing concept is to be followed, a 
weighted average should be used to compute the forecast 
positions. The weighting function should be inversely pro- 
portional to the least squares distance from each map in 
the map type to the case being forecast. This procedure 
would give more weight to the positions from the maps that 
most closely resemble the present case. In other words, 
the more similar the maps are, the more likely the synoptic 
forcing is similar. Therefore there is a higher probability 
that the future movements will be similar. Thus the fore- 
cast should be improved. 

The past 12-hour movements should also be compared be- 
fore using all the cases in the map type to compute a 
forecast movement. Any case within the map type that does 
not have a similar past movement, within say plus or minus 
30-40 degrees, should not be included when computing the 
forecast position. The justification for this procedure is 
that if the past 12-hour movement is so much different than 
the current case, the forcing mechanisms for the two cases 
are most likely different. This could be due to rapidly 
changing patterns or to small-scale features in the synoptic 
situation that the map-typing scheme did not resolve. Later 
versions of the analog forecast scheme of Jarrell and 
Somervell (1970) use a similar restriction on permissible 
past-motion vectors in the analog scheme. 
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Another improvement to the methodology that should im- 
prove the selection of the map types is to devise a more 
statistically significant means of selecting the least 
squares distance than the trial and error method used here. 

Another more radical change to the procedure would offer 
the potential for still greater improvements. This would 
be to vary the size of the domain based on the forecast 
interval. Thus the domain would be smaller for the 30-hour 
forecast than the 34-hour forecast. In this study only one 
grid domain was used for all forecast intervals. By vary- 
ing the domain size we can select the features that affect 
the movement in the interval. For example, with the domain 
used in this study, the variance expressed by features at 
the extreme edge of the domain (2400 n.mi. away) may have 
no influence on the 30-hour movement, but may be very signi- 
ficant in the longer time frames. The proposal here would 
be to shrink the domain for the 30-hour and 36-hour fore- 
casts, in order for the EOF procedure to maximize the 
variance in the area that is most likely to affect the 
storm's movement in that time interval. 

Another possibility is to revert to a strictly analog 
technique, in which each of the 504 cases is treated as a 
potential analog. Instead of having predetermined map 
types, simply select the 10 best fits, in the least squares 
sense, to the present map. Then use these 10 best fits to 
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compute the forecast position. This would perhaps give a 
better forecast position, but it would not have the advan- 
tage of having a predetermined estimate of the accuracy of 
the forecast based on the map types. 
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VI. SUMMARY 



The objective of this initial study was to use the hy- 
pothesis that tropical storm motion is directly related to 
the forcing of the synoptic height fields , and to construct 
map types around the tropical storm that represent this 
rorcing. After this forcing, represented by the map types, 
is determined, we then use it to derive forecast storm 
positions . 

First, the FNOC analysis fields are interpolated to a 
standard grid based on the tropical storm position. Then 
the maps are organized into a matrix and an EOF analysis is 
completed. The EOF are then truncated in an effort to 
eliminate noise and the subgrid scale phenomena. The time 
coefficients of the truncated EOF are used to compare the 
maps in eigenvector space. A least squares procedure is 
used for these comparisons. The maps are highly correlated 
and will tend to form in clusters in the eigenvector space. 
3y identifying these clusters and picking the maps that are 
most representative of each of these clusters, the map 
typing is accomplished. The map types are then used to 
divide the 504 individual cases into groups. The maps in 
each group represent similar synoptic situations, which can 
be used to derive forecast storm positions. The forecasts 
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are developed by using the known best-track positions of 
the storms in each map type and rotating them so that they 
have the same past 12-hour movement as the case currently 
being forecast for. An average position from the map type 
is then used as the forecast. 

This research shows that the concept is valid. The 
procedures involved need to be optimized and refined. Map 
types can indeed be formed using the EOF analysis and least 
squares statistics. The forecast movement errors arrived 
at in this study, do not appear to be as good as the offi- 
cial forecast, but since the technique was not optimized, 
they were not expected to be as good. The fact that the 
errors in this research were as close to the official fore- 
cast errors as they were, indicates that further refinement 
of the technique is warranted. By optimizing the procedures 
and employing some of the refinements from Chapter V, this 
technique should be able to equal or better the official 
forecast . 
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APPENDIX A 



Methods for representing arrays of environmental fields 
by empirical orthogonal functions have been formulated in a 
number of different ways (for example Lorenz (1956), Frei- 
berger and Grenander (1965), Kutzbach (1967), Essenwanger 
(1976), and Little (1980)). Most of the information pre- 
sented in this appendix is based on the work of Kutzbach 
(1967) as presented by Little (1980). 

Let A be an M component vector that represents the nth 
(n=l,...,N) observation. Let A be an M by N matrix, where 
the nth column is the observation vector A . The goal is 
to determine a vector e which has the highest degree of simi- 
larity to ail the A n vectors. Kutzbach (1967) explains that 
the similarity is measured with the squared normalized inner 

Droduct of the observation vectors A and the vector e. 

n 

After averaging over all of A, the problem is to maximize 



(e'A^)N ^/e'e 

where the prime denotes 
to maximizing, 



(e'AKe'A) ' _ 

Ne'e 

the transpose. 



e ' AA ' e 
Ne'e 



( 1 - 1 ) 



This is equivalent 



e ' Be 
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where B represents the normalized symmetric (M x M) covari- 
ance matrix, and is subject to the conditions 

e'e = 1 (1-2) 

and 

B = N _1 [A' A] (1-3) 

Furthermore, Kutzbach (1967) points out that the maxi- 
mization of (1-1) subject to (1-2) leads to the equation 

Be = eA (1-4) 

The vector e and the parameter A are recognized as an eigen- 
vector and associated eigenvalue of B . It can be shown 
that the e are orthogonal and that the A^ are real and posi- 
tive. Thus for the whole matrix, (1-4) can be written as 

BE = EL (1-5) 

where E is the M x M eigenvector matrix and L is the M x M 
diagonal eigenvalue matrix. Also note that 

E'E = 1 (1-6) 

It is now assumed that the elements of L and the respective 
columns of E have been arranged such that is the largest 
X and A 2 is the ne xt largest X, etc. From (1-6) the trans- 
pose of an orthogonal matrix is equal to its inverse. Thus 

combining (1-3) and (1-5), the result is 

E'AA' = LN 
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Now derine C - E'A, where C is an M x N matrix. It follows 
that 



A = EC 



and 



M 

A n = 2 C in e i ’ n=1 N (1 - 7) 

i = l 

Thus from (1-7), the observation vector A^ can be expressed 
as a linear combination of the m eigenvectors and n C vectors 
or coeffficient vectors. If the entire E and C matrices 
are used the exact data matrix A can be reproduced. 

In most applications to meteorological fields, where the 
data vectors A are highly correlated , it is found that a 
large portion of the variance (X) can be accounted for by 
retaining only the first few of the largest eigenvalues (X) 
and eigenvectors (e). Thus by retaining only the first few 
terms of (1-7), an excellent representation of the observa- 
tion can be obtained, with the rest of the terms being con- 
sidered noise. Thus if an A n is defined as the approxima- 
tion of the nth observation vector then 



M 



n 



C . e . 
xn i 



, n= 1 , . . . ,N M < M 



i= 1 
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This allows for truncation of the noise by removing the 
terms with small variance (X) as shown by Preisendorf er and 
Barnett (197 7 ) . 




6.5 



The following is the sequence of operations/computer routines that were used 
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APPENDIX C 



4 




Figure Cl. 850 mb map types 1-6. Isopleths are 
D-values Cm) . 
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Figure C2 . 700 mb map types 1-6. Isopleths are O-values Cm). 
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Fig. C2 . cont . 700 mb map types 7-12. 
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APPENDIX D 




Figure Dl. Examples of 500 mb maps. Isopleths are D-values 
Cm) and "the classification by map type is indi- 
cated by the number above the map. 
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Fig. Dl. cont . Examples of map types 5-8. 
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